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C^l . Abstract 

Q ■ In this paper we present results and conjectures on the van der Waerden numbers w(2; 3, t). We have computed the 
\ exact value of the previously unknown van der Waerden number w(2; 3,19) = 349, and we provide new lower bounds 
for f = 20, 21, . . . , 39, where for / = 20, . . . , 30, we conjecture these bounds to be exact. The lower bounds for 
w(2; 3, t) with f - 24, . . . , 30 refute the conjecture that w(2; 3, f) ^ r as suggested in Based on the known values 
of w(2; 3, (), we investigate regularities to better understand the lower bounds of w(2; 3, t). We also provide heuristics 
, to generate examples of good partitions for w(2; 3, f). Motivated by such heuristics, we introduce palindromic van 
\m/ • der Waerden numbers ^dw(k; fo, ■ ■ . , h-x), which are defined as the ordinary numbers w{k; to,. . . , ft-i), but where 
I only palindromic solutions are considered, reading the same from both ends. Since the underlying property is non- 
(-H ■ monotonic, these numbers are actually pairs of numbers. We compute pdw(2; 3, t) for 3 ^ f ^ 25, and we provide 
bounds for 26 ^ f ^ 39, which we believe to be exact for t ^ 35. All computations are based on SAT solving, and 
we discuss the various relations between SAT solving and Ramsey theory. 
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psj ' 1. Introduction 

> ■ J— 1, 

, We consider Ramsey theory and its connections to computer science (see 04511 for a survey) by exploring a rather 

ff^ ' recent link, especially to algorithms and formal methods, namely to "SAT solving". SAT is the problem of finding 

, a satisfying assignment for a propositional formula. Since Ramsey problems can naturally be formulated as SAT 

' problems, SAT solvers can be used to compute numbers from Ramsey theory. In the present article, we consider van 

, der Waerden numbers, where SAT had its biggest success in Ramsey theory: determination of w(2; 6, 6) to be 11 32 



■ (311, a new diagonal van der Waerden number after almost 30 years. 

The van der Waerden number w{k\ to,ti, . . ., tk-i) is the smallest n such that for any partition {1,2, . . . ,n} - Pq U 
P[ U ■ ■ • U Pk-i there exists a j in {0, 1 , . . . , A; - 1 1 such that Pj contains an arithmetic progression of length tj. A 
good partition of the set {1, 2, . . . , n} corresponding to w{k; fo, fi, . . . , fi_i) contains no block Pj with an arithmetic 
k> ' progression of length tj (for any j). In this paper, we are interested in the specific van der Waerden numbers w(2; 3, t), 
^ , t ^ 3. The known exact values of w(2; 3, t) with f = 3, 4, . . . , 18 are 



9, 18,22,32,46,58,77,97, 114, 135, 160, 186,218,238,279, and 312. 

As references and for relevant information on the above numbers, see Chvatal (9], Brown fl\, Beeler and O'Neil 
101, Kouril 131], Landman, Robertson and Culver 1401, and Ahmed 12, 31. Recently, Kullmann [3513 reported the 
following lower bounds 

w(2; 3, 19) ^ 349, w(2; 3, 20) ^ 389, w(2; 3,21)^ 416. 

We confirm the exact value of vv(2; 3, 19) - 349, and we extend the list of lower bounds up to f = 39. Brown, 
Landman, and Robertson 181, provided the theoretical lower bound w{2; 3, t) ^ fi-o{i))^ ^jj^j observed that w{2; 3, t) ^ 
for 5 ^ f ^ 16. Our experiments show that this assumption is not true for all f. We provide an improved upper 
bound as a conjecture (satisfying all known values and lower bounds of w(2; 3, f))- 



the conference article 13611 contains only material related to Green-Tao numbers and SAT 
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1.1. Using SAT solvers 

As explored in Dransfield et al. ill], Herwig et al. Kouril Jsil], Ahmed HH, and Kullmann SH, we 



can generate an instance F of the satisfiabiUty problem (for definition, see any of the above references) corresponding 
to w{k; t(),t\, . . . , tk-\) and integer n, such that F is satisfiable if and only if n < w{k; to,ti,. . ., tt^i). In particular, an 
instance corresponding to w(2; 3, t) with n variables can be generated with the following clauses: 

(a) [xa, Xa+d, Xa+2d} with fl ^ l,d ^ 1, fl + 2(i ^ «, and 

(b) {Xa, Xa+d, ■■■ , Xa+d(t-l)} with fl ^ l,d ^ 1 , fl + (i(f - 1 ) < M, 

where x, = s encodes / € for e e {0,1} (if x, is not assigned but the formula is satisfied, then / can be arbitrarily 
placed in either of the blocks of the partition). Clauses (a) prohibit the existence of an arithmetic progression of length 
3 in P() and clauses (b) prohibit the existence of an arithmetic progression of length f in Pi . 

To check the satisfiability of the generated instance, we need to use an algorithm that either solves the instance 
and provides a satisfying assignment, or says that the formula is unsatisfiable. We have at our disposal two kinds 
of algorithms: complete and incomplete. A complete algorithm like DPLL (see [12. ,1 1,1 : Algorithm [T] is a slightly 
modified variant given in |j3|]) finds a satisfying assignment if one exists, and otherwise correctly says that no satisfying 
assignment exists and the formula is unsatisfiable. SAT solving has progressed much beyond this simple algorithm 
(see the handbook [§] for general information), however on this special problem class this basic algorithm together 
with a basic heuristics (see Subsection 12. lb is very competitive; see Section|5]for more information on SAT solvers in 
this context (and see lISOll for applications of SAT to combinatorics). 



Algorithm 1 DPLL algorithm 

1: function DPLL(F) 

2: while True do 

3: if (m) 6 F and {u] e F then return Unsatisfiable 

4: else if there is a clause {v) then F - F\v 

5: else break 

6: end while 

7: it F -d) then return Satisfiable 

8: Choose an unassigned literal u using a branching rule 

9: if DPLL(F|m) = Satisfiable then return Satisfiable 

10: if DPLL(F|m) = Satisfiable then return Satisfiable 

11; return Unsatisfiable 

12: end function 



Given a formula F and a literal u in F, we let F\u denote the residual formula arising from F when u is set to true: 
explicitly, this formula is obtained from F by (/) removing all the clauses that contain u, (ii) deleting u from all the 
clauses that contain S, (///) removing both u and u from the list of literals. Each recursive call of DPLL may involve 
a choice of a literal u. Algorithms for making these choices are referred to as branching rules. It is customary to 
represent each call of DPLL(F) by a node of a binary tree. By branching on a hteral u, we mean calling DPLL(f |m). 
If this call leads to a contradiction, then we call DPLL(F|m). Every node that is not a leaf has at least one child and 
may not have both children. We refer to this tree as DPLL-tree of F. 

Local-search based incomplete algorithms (see Ubcsat-suite lH^) are generally faster (as they try to find a satis- 
fying assignment as fast as possible when we can control the heuristic, number of iterations, runs, and several other 
parameters) than a DPLL-like algorithm, but may fail to deliver a satisfying assignment when there exists one. A good 
partition is a proof of a lower bound for a certain van der Waerden number irrespective of its means of achievement. 
So incomplete algorithms are handy for obtaining good partitions and improving lower bounds of van der Waerden 
numbers. When they fail to improve the lower bound any further, we need to turn to a complete algorithm. 

1.1.1. Parallel/distributed SAT solving 

The problems we consider are computationally hard, and for the hardest of them in this paper, computation of 
w(2;3, 19) - 349, a single processor, even when run for a long time, is by far not enough. Hence some form of 



2 



parallelisation or distribution of the work is needed. Four levels of parallelisation have been considered for SAT 
solving (in a variety of schemes): 

(i) Processor-level parallelisation: This helps only for very special algorithms, and can only achieve some relatively 
small speed-up; see |23] for an example which exploits parallel bit-operations. It seems to play no role for the 
problems we are considering. 

(ii) Computer-level parallelisation: Here it is exploited that currently a single (standard) computer can contain up 
to, say, 16 relatively independent processing units, working on shared memory. So threads (or processes) can 
run in parallel, using one (or more) of the following general forms of collaboration: 

(a) Partitioning the work via partitioning the instance (see below); lIsilEsI] are "classical" examples. 

(b) Using the same algorithm running in various nodes on the same problem, exploiting randomisation and/or 



sharing of learned results; see 11251 11711 for recent examples . 
(c) Using some portfolio approach, running different algorithms on the same problem, exploiting that various 

algorithms can behave very differently and unpredictably; see lfl6ll for the first example. 
Often these approaches are combined in various ways; see for recent examples. Approaches (b) 

and (c) do not seem to be of much use for the well-specified problem domain of hard instances from Ramsey 
theory. Only (a) is relevant, but in a more extreme form (see below). In the context of (ii), still only relatively 
"easy" problems (compared to the hard problems from Ramsey theory) are tackled. 

(iii) Parallelisation on a cluster of computers: Here up to, say, 100 computers are considered, with restricted com- 
munication (though typically still non-trivial). In this case, the approach (ii)(a) becomes more dominant, but 
other considerations of (ii) are still relevant. For hard problems this form of computation is a common approach. 

(iv) Internet computation, with completely independent computers, and only very basic communication between 
the centre and the machines: Here only (ii)(a) is applicable, and, in principle, the number of computers is 
unbounded; yet there is no real example for a SAT computation at this level. 

We remark that the classical area of "high performance computing" seems to be of no relevance for SAT solving, since 
the basic SAT algorithms like unit-clause propagation are, different from typical forms of numerical computation, 
inherently sequential (compare also our remarks to (i)). 

Now a major advantage of the DPLL solver architecture discussed in Subsection 11.11 (which has been further 
developed into so-called "look-ahead" SAT solvers) is that the computation is easily parallelisable and distributable: 
Just compute the tree only up to a certain depth d, and solve the (up to) 2'' sub-problems at level d. Only minimal 
interaction is required: The sub-problems are solved independently, and in case one sub-problem has been found 
satisfiable, then the whole search can be aborted (for the purpose of mere SAT-solving; for counting all solutions of 
course the search needs to be completed). And the sub-problems are accessible via the partial assignment constituting 
the path from the root to the corresponding leaf, and thus also require only small storage space. This is the core of 
method (ii)(a) from above, and will be further considered in Subsection l2.1l (for our special example class). 

In the subsequent subsection we will discuss the general merits of applying SAT solving to (hard) Ramsey prob- 
lems. One spin-off of this combination lies in pushing the frontier of large computations. As a first example we have 
developed in l'22'l, motivated by the considerations of the present article, an improved method for (ii)(a), which is also 
relevant for industrial problems (typically from the verification area). One aspect exploited here is that for extremely 
hard problems splitting into millions of sub-instances is needed. In the literature until now (see above for examples) 
only splitting as required, by at most hundreds of processors, has been performed, while it turned out that the above 
"extreme splitting", when combined with "modem" SAT solvers, is also beneficial in less extreme settings, and on 
a wide range of examples. See [37] for the elaboration of this method in the context of the problem classes of the 
present article. 



1.1.2. Synergies between Ramsey theory and SAT 

For Ramsey-numbers (see [44] for an overview on exact results), relatively precise asymptotic bounds exist, and 
due to the inherent symmetry, relatively specialised methods for solving concrete instances have an advantage. Van- 
der-Waerden-like numbers seem harder to tackle, both asymptotically and exactly, and perhaps the only way ever to 
know the precise values is by computation (and perhaps this is also true for Ramsey-numbers, only more structures 
are to be exploited). SAT solvers are especially suited for the task, since the computational problems are hypergraph- 
colouring problems, which, at least for two colours, have canonical translations into SAT problems (as only considered 
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in this paper). For more colours, see the approach started in 113611 . while for a general theory of multi- valued SAT close 
to hypergraph-colouring, see [38, 39]. 

Through SAT computations (as in the present article), Ramsey theory acquires an applied side. Whenever for 
example a recent microchip is employed, this likely involves SAT solving, playing an important (though typically 
hidden) role in its development by yielding the underlying "engines" for its verification. See the recent handbook H 
to get some impression of this astounding development. We believe that problem instances from Ramsey theory are 
good benchmarks, serving to improve SAT solvers on hard instances: 

• Unlike with random instances (see m for an overview), instances from Ramsey theory are "structured" in 
various ways. 

• In this paper, we consider two instance classes: instances related to ordinary van der Waerden numbers w(2; 3, f) 
and instances related to the palindromic forms pdw(2; 3, t). Now already with these two classes, the two main 
types of complete SAT solvers, "look-ahead" (see Kill ') and "conflict-driven" (see lH^l), are covered in the sense 
that they dominate on one class each (and are (relatively) efficient); see Section |5] for further details. On the 
other hand, for random instances only look-ahead solvers are efficient (for complete solvers). 

• Like with random instances we get scalability, though due to the possibly large and unknown growth of Ramsey- 
like numbers, the control of the parameters is more complicated here. Thus it is only through computational 
studies like this paper, serving to calibrate the scale via precise numerical data, that the field of SAT instances 
from Ramsey-theory becomes accessible. 



• Especially for local-search methods (see 112911 for an overview), these problems are hard, but not overwhelmingly 
so (for the ranges considered), and thus all the given upper bounds can trigger further progress (and insight) 
into the solution process in a relatively simple engineering-like manner. 

• On the other hand, for lower bounds we need to show unsatisfiability, which is much harder All applica- 
tions of SAT solving in hardware verification are "unsatisfiability-driven" (see ^ [32I1 for introductions). So 
future progress in solving hard Ramsey instances might have the potential to trigger a breakthrough in tackling 
unsatisfiability, and should then also improve these industrial applications. 

We believe that for better SAT solving, established hard problem instances are needed in a great variety, and we 
believe that Ramsey theory offers this potential. To begin the process of applying Ramsey theory in this direc- 
tion, problem instances from this paper (as well as related to j36ll ) have been used in the SAT 2011 competition 
(httpi/Zwww. satcompetition.org/2011/). As akeady mentioned in the previous subsection, the first fruits of 
the collaboration between SAT and Ramsey theory appeared in [22, 37], yielding a method for tackling hard problems 
with unprecedented scalability. 

Finally the interaction between Ramsey theory and SAT should yield new insights for Ramsey theory itself: 

1. The numerical data can yield conjectures on growth rates; see Subsection l2.4l 

2. The good partitions found can yield conjectures on patterns; see Section[3] 

3. New forms of Ramsey problems can be found through algorithmic considerations; see Section|4] 

4. The SAT solving process, considered in detail, acts like a microscope, enabling insights into the structure of 
the problem instances which are out of sight for Ra msey th eory yet. For approaches towards structures in SAT 
instances which we hope to study in the future see 



imseytr 



1.2. The results of this paper 

Section |2] contains our results on the numbers w(2;3,0- We discuss the computation of the one new van der 
Waerden number, and present further conjectures regarding precise valued and the growth rate. In Section |3] we 
investigate some patterns we found in the good partitions (establishing the lower bounds). 



^to establish these conjectures will require major advances in SAT solving 



4 



In Section H) we present a new type of van-der-Waerden-like numbers, namely palindromic numbers, obtained 
by the constraint on good partitions that they must be symmetric under reflection at the mid-point of the interval 
{!,...,«}. Perceived originally only as a heuristic tool for studying ordinary van der Waerden numbers, it turned out 
that these numbers are interesting objects on their own. An interesting phenomenon is that we no longer have the 
standard behaviour of the SAT instances with increasing n, where 

• first all instances are satisfiable, and from a certain point on (the van der Waerden number) all instances are 
unsatisfiable, 

• but now first again all instances are satisfiable, then we have a region with strict alternation between unsatisfia- 
bility and satisfiability, and only from a second point on all instances are unsatisfiable. 

These two turning points constitute the palindromic number pdw(2; 3, f) as pairs of natural numbers. We were able 
to compute pdw(2; 3, f) for t ^ 25, exploiting now a diff'erent type of SAT solver, sometimes euphemistically called 
"modern SAT solvers" due to their dominance for industrial applications, the many ways to optimise them, and their 
erratic way of "brute-force intelligent learning". We also provide (conjectured) values for t ^ 39. 

Finally in Section |5] we discuss the observations on the use of the various SAT solvers involved. In [Appendix A| 
we show how to access all the numbers and many methods in the DKlibrary, an open-source research platform for 
hard problems (around the SAT problem). 

In this paper, we represent partitions of w(2; 3,0 as bitstrings. For example, the partition f o - (1,4, 5, 8) and 
Pi = {2, 3, 6, 7}, which is an example of a good partition of {1, 2, . . . , 8), where 8 = w(2; 3, 3) - 1, is represented as 
01100110, or more compactly as 01^0^ 1^0, using exponentiation to denote repetition of bits. 

2. Computational results on w(2; 3, t) 

This section is concerned with the numbers w(2;3,f)- The discussion of the computation of w(2;3, 19) is the 
subject of Subsection 12. II Conjectures on the values of w(2; 3, f) for 20 ^ f ^ 30 are presented in Subsection 12.21 
while further lower bounds for 31 ^ f ^ 39 are given in Subsection 12.31 Finally in Subsection 12.41 we update the 
conjecture on the (quadratic) growth of w(2; 3, t). 

2.1. w(2;3,19) = 349 

The lower bound w(2; 3, 19) ^ 349 was obtained by Kullmann fs^l using local search algorithms and it could 
not be improved any further using these incomplete algorithms (because, as we now know, the bound is tight). An 
example of a good partition of the set {1, 2, ... , 348) is as follows: 

l''0l''0l"*0l'01^0l'01''0l"0l'0l'0l'^0l'0l'0l'*'01 "^0101-0^ l-0l'^01^0l"0 
l''01^01^0l'0l"'0l"01^0l''0l'^01''0l''01^0^1^010l''0l'^0l"'0l'0l"'01^ 

To finish the search, i.e., to decide that a current lower bound of a certain van der Waerden number is exact, might 
require many years of CPU-time (perhaps trillions of years for the bigger lower bounds). Discovering a new van der 
Waerden number has always been a challenge as it requires to explore the search space completely, which has a size 
exponential in the number of variables in the corresponding satisfiability instance. To prove that an instance with 
n variables is unsatisfiable, the DPLL algorithm has to implicitly enumerate all the 2" solutions. So the algorithm 
systematically explores all possible solutions, however without actually explicitly evaluating all of them — herein lies 
the strength (and the challenge) for SAT solving. 

Dividing the computation of a search into parts that have no inter-process communication among themselves is 
straightforward. DPLL has this desirable property like many backtrack algorithms. We may pick a level, say I, of the 
DPLL-tree and distribute the subtrees rooted at that level among the processors. The appropriate value of { may depend 
on many factors like the number of computers available, and should be decided on a case-by-case basis. Distributing 
the tasks evenly will not guarantee the reduction of computation time by a factor close to the number of CPUs, as 
some of the subtrees may be considerably larger than others, or that a computer involved may be relatively slow. 
Advanced splitting techniques for problem-specific DPLL-tree would help to predict which branch would require to 
be splitted recursively. In Subsection II . 1 . II we gave an overview on this area from a general SAT perspective, and we 
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are concerned here with method (ii)(a). We discuss now the special structures found for the concrete instance under 
consideration. 

The size of the DPLL-tree greatly varies with the choice of the branching rule (or heuristic). It is hard to find 
a branching rule that works good on every SAT instance. On a formula F, define w(m) - Y,k '^ '^dk(u), where dk{u) 
denotes the number of clauses of length k containing u. First we find a variable x that maximises w{x) + w{x), and then 
we choose x if w{x) ^ w{x), and x otherwise. This branching rule is known as Two-sided Jeroslaw-Wang (2sJW), (by 
Hooker and Vinay ll24ll ) in the literature. Our DPLL implementation reads branching suggestions, if there is any, up 
to a prescribed level, and then explores the search space corresponding to the residual formula after those branches. It 
also periodically stores the current state of the search as a sequence of pairs (m, f) where u e {x\,xi,X2, X2, . . ., x„, x,,} 
and f e {0, 1}. A pair (m, 0) indicates that u has to be explored, and (u, 1) indicates that u has been explored akeady. A 
generator version of the solver generates the branching suggestions up to prescribed level, and then the solver runs on 
different suggestions corresponding to the respective branches of the DPLL-tree. This involves a slight modification 
in Algorithm[T] which can easily be done. By choice of this branching rule on the formula corresponding to w(2; 3,19) 
with n - 349, for a certain value of the parts (sub-trees of the DPLL-tree) Pq, P\,P2, Pa, P&, P\6, ■ . ■ are bigger than 
the others parts, and Pq is the biggest. This observation was made in Ahmed [jst] and the pattern in the sizes of the 
branches of the DPLL-tree helps to plan recursive splitting of the DPLL-tree to control the actual run-time. 

We have proved that there is no good partition of the set {1,2,..., 349) with respect to w(2; 3, 19), and hence the 
exactness of the lower bound w(2; 3, 19) ^ 349 given by KuUmann li35J . We have used 2.2 GHz AMD Opteron 64-bit 
processors (200 of them) from the cirrus cluster at Concordia University for running the distributed branches of 
the DPLL-tree corresponding to w(2; 3, 19) with n - 349. Roughly 196 years of CPU-time has been spent. We have 
splitted the search space into 256 independent parts Pq, Pi, ... , f 255 with ^ = 8. Part Pq alone has taken roughly 60 
years of CPU-time. Other parts, which were making slow progress were also splitted recursively. 

In such a large computation, where thousands of distributed branches of the DPLL-tree have run on hundreds 
of processors for almost a year, we hope we have not fallen into the trap of an undetected hardware failure (an 
electricity failure is natural and every detected hardware -failure was re-run from the last state of the search), or a file 
manipulation error on a particular branch which could contain a good partition of the set {1,2,..., 349}. Of course, 
also a software error is possible. We welcome interested readers with proper resources to conduct another search 
to verify our result. However, we have no doubt that w(2; 3, 19) - 349 holds true, since not only do we have the 
completed run of the first author's dedicated solver, but also the data gathered by the second author, using local search 
algorithms, gives strong evidence: solving for n - 348 is very easy, and in all other similar (smaller) cases this sudden 
change of behaviour has been validated as been caused by unsatisfiability. We would also (again) Uke to point out that 
it seems that there is no alternative method available, and perhaps there will never be. SAT solvers have a long and 
distinguished history, and the connections to formal methods and verification gives a strong background for correct 
and efficient implementations. Definitely no less than for example computer algebra systems, but likely more so due 
to the non-numerical and inherently simple nature of the task. See Section |5] for further remarks on SAT solving for 
these instances in general. Finally, we remark that it would be highly desirable to be able to substantially compress 
the resolution proofs obtained from the solver runs, so that a proof object would be obtained which could be verified 
by certified software (and hardware); see fio'l for some recent literature. 



2.2. Some new conjectures 

In this section, we provide conjectured values of w(2;3,0 for t = 20, 21, . . . , 30. We have used the Ubcsat 
suite [I49] of local-search based satisfiability algorithms for generating good partitions as a proof of the lower bounds 
we provide. See Section |5] for details of the algorithms used to find the good partitions. Since local search based 
algorithms are incomplete (they may fail to deliver a satisfying assignment, and hence a good partition when there 
exists one), it remains to prove exactness of the numbers using a complete satisfiability solver or some complete 
colouring algorithm. 

2.2.1. 1^(2; 3, 20) ^ 389 

l''0l''01''0l'{)101''0l'-H)l'{)1^0I0l'*0-1^0-l{)l''0l'*0l''01''0I0l'^01-'0l'°0l''*{)l'*0l'''0l'°01-'0l'0101''0l'' 
0l"'0l''010^1-'0^l'''0101''0l'0l'-^01*010l'01*0l"01-° 
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2.2.2. w(2;3,21) ^ 416 

1*01 "010l''010l'Ol'^0^l''0l'''01-^O10l20l'-0^ 1^0- 101 '°oi'*oi-^oio^i*oi'^oi-°oi'*oi^oioi'oi''oi'-o^ 
101 "01 "'Ol-'OlO-l^Ol^Ol'O-l'^Ol'Ol "0101*0101 "01-' 

2.2.3. w(2;3,22) ^ 464 

j2q2j17q2j9qj12qjqj12qj15qj2qj1C1qj4q[7qj5qj12qj9qjqj3qj8qj3qjqj9qj12qj5qj7qj4qj10qj2qj15qj12 

0101 '-0l''0-l"0-l''0l'-0101 '-01 "01-01 "'oi-'oi'oi'^oi'-oi''oioi-'oi*oi^oi''oi'-oi'^oi'oi" 

2.2.4. w(2;3,23) ^516 

l''0-1^0l"01''010l"0l"'01''0l'01-''0101^01*0^101''0l"01-01''0l"^0l''0l"'010l''0l"'0l'0l"0l'^010l'''0l"^0 

j2lQ2jl9Q,6Qj2ml2QjQ2j4QjQj20Qjl3Q2jllQj9Q2j6yj4g,13Qjm3m8Q[<)Qj20Qj5ml8m3Q[ 

2.2.5. w(2;3,24) ^ 593 

j21qj1Soj16qj4oj7oj6qjqj14qj3q2jSqj7qj3qj2qj20o2j3qj7oj15qj7qj3q2j20qj2qj3qj7qj9qj18qjqj6qj21qj7q 

l"'0l'01^'0l'^010l"'0l''0l'0l'01^01^''0^l'0l'0l'^0l'0l'0^1™01^0l'0l'01*0^l'0l"'010l'*0l'01''0l"^0l"'0p' 

2.2.6. w(2;3,25) ^ 656 

l"01-0l"0l''0l'0l'''0l''01''010l'0l"0l"'01^'0^1^010l"'0f0l'^01''01^0l'*0f0l"01^0l'^01''0l"0101^ 

Qj6Q2j6Qjl7QjaQj7Qjl3Qjl4Qj2Qj4Q|13Q2|aQj7Qjl9Q]8Qj4Q,Q[7Qjl5Qjl0Qj9Q|llQ2j2QJQ[lOQ[5Qjl4Q 

i*oi-^oi'oi'oi"oi-'oi'-'oi''oi"oioi-'oif'o-i-''oi''oi'' 

2.2.7. w(2;3,26) ^ 727 

l"'01--\)l'°0101™01-'0l"01^'0l"02l-01^01<'01^01--\)l''010l'0l"0l"0101 "0-1^0101-^01*01-01'* 
01^01^01 '•'Ol '^01 '"01*01 ■''01*010- 1*01 '-\)l'*0101-"01''0l'*0l"0-l-0l'0l''01^0l''01*010l'0l'^010l"^ 
0101 "O^ 1-0101-^01*01-01 '*0l''0l'*0l'201 '"01*01 '''oi*oi2oi"*oi-^oi-^ 

2.2.8. w(2;3,27) ^ 770 

i-*oi-\)i'oi'*oi'^ioi^oi-'oi-'oi'oi-oi^°o-i'-oi"oi'°oi"oi-\)i''oi"oi'-'oi^-oi^o^ 

l*01^01-''0l"010l"'0l'01^0l'010^101^'01^0^l'*0l''0l"0101-01-'0l"0l'01^0101-01^°0-l'^0l'^01^^ 
01-'0l''0l"01 "01^-01^0^ l*0l'01™01*0l'"0l'01-'0l'010^10l"0l"01*01-°0l'-'^0l"'0l' 

2.2.9. w(2;3,28) ^ 827 

i-'oi"'oi--oioi'"oi'^oi'"oi-''oi*oi'"oioi"o-i^oioi^oi"o-i'*oi'oi^oi'*oi'-oi-'oi'*o 

l'-^010-l*0l2^01*0101-^0l"'0l"0l'0101-^01*0l'0l"0l"'010l'0l"0l"01*0l'^0^1-0101-'0l' 
0l'*01^01^0l'*0l'^0l'"01*0l'*0l'-^010-l"*0l''01*0101=^0l"*0l"0l''01*0101--^0l"'0l''0l'-0l'"0l'" 

2.2.70. w(2;3,29) ^ 868 

j15qj21qj18qj17qj18,)[21,)[2qj7qj25qj,)[12,)[17qj6qj7q2j2qj17qj5q2jqj10qj6qj13qjqj2qj17qj16qj7qj4 

01™0101-0l'0l''01-'0l'°01^'01^0-l'"01-'0l'°01*01-°0101-0l"01-''0l'*0- 1-01 ''01^0- 101 '°0l"01 '-'0101- 

oi''oi'oi''oi<'oi'oi*oi-°oioi-oi"oi^'oi^oi'oi^'oi-'oi'"oi"oi'-oi-'oi"oi"oi* 

2.2.77. w(2;3,30) ^ 903 

i'-oi'"oi*2oi-"oi'oioi*oioi2-oi'-oi'"oroi"oi''oi''oi"o'i''o'i''oi'oi*oioi-'oi"oi2*oroi-^i*oi-- 

01*0101-o2l"0101^^0l"0l'*01-'01 "01*01 '"0'l"0101^01-*0l"0^ 101 "'01^0l'*0l'0l"'01-'0l"0l2* 
0l'01^0l20-l'*0l201*010l'0l"010l'"0l2^0101*01-'0l"'01*0^l"0l'-01^0l"0l'°0l'* 

We observe that for f = 24, 25, . . . , 30 we have w(2; 3, t) > fi, which refutes the assumption that w(2; 3, t) ^ fi, as 
suggested in based on the exact values for 5 ^ f ^ 16 known by then. 
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2.3. Further lower bounds 

2.3.1. w(2;3,31)>930 

l'^0l"0l'*0l"*0l'01^01''0l"0l'^0l"'0l'*0^l"'01''0l"01*0l'010l"'0l"0l"0l"'01*01^01*0101''01*0l" 

Ol'''0l'0l'*01*01^'0l'''0l'''01^0l''0l'''0l'^0l"01^()l"0l'-^0l''0l''0l'0l"'0l"01^'01''0l"'01^0l"''0l'''0l'' 
01''010l''01^0l'*0l"'0l"0l"0l"'0101^0l'*0l"01''0l"'0^l"*0l"'0l'^0l"01''0p0l''0l'"0l"'0l'''0l'^ 

2.3.2. w(2;3,32)> 1006 

jl5gj26oi5QjllQj4Qjl4Qj4gjl6gj28oi22Qj2gj5Qj7gjl9gj2QjQjl6Qjl3oi5gj3Qj7Qj4Qjl3Qj9Qjl4Qj28Qj7Qjl5QjlO 

OlO^l'*0^P0l"0l'^OP'01^'01^OlOl*0l'Ol"'Ol'^O1^01^''0l'"O1^010l"0l"'0^1''0p'*0l''Ol"O^l*Ol'*0l'O 

jllgj3Qjl0QjQ2j28Qjl4Qjl7Qj3gjl2gjl3Qj5gjl6gjl3gj3Qjl0Qj22Qj2gjl2Qj7gj28gj20gj2Qj5gj4Qj22Qj8gj28 

2.3.3. w(2;3,33)> 1063 

j29gj6Qjl4QjllQj2lQjl4gj2Qjl8QjQjl5Qjl2Qj25oil6gjgjllg2j4gjlOgj5Qjl3Qjl6Qj20gj2Qjl3Qj22Qj5gj4QjQjl9oi9g 

l''Ol''0l''010l'''0l"0^1*Ol'ol*01^01^*0l'^Ol'0P''01''Ol'''Ol"'0^l'*0l'''010l'0l"0l"'01^*0l'^Ol^''01*Ol''0l' 
01"01''01'«01''01''01''01^01*'01"010P01"'01'''01'«0^1^01"'01^01'''0P01"'01"01"*01''01''01^''01''01'''01" 

2.3.4. w(2;3,34)> 1143 

P^Ol'0101*01^'01^'OPO^l'°Ol'Ol"01^0l'01^0l'*'Ol'^01™0^1^0l'0^10P'Ol'*01*'Ol'^01^0l"Ol'Ol''0^1^*'01* 

01^0101^'' o^i'oPoi'oP'oP''oi'"oi"oP''oi'''oi''*oi'^o^P'*oi'oi'oi'''oi"oi()^i-''oioi''oi'oi^oi-''o^i^*oi'oi'''o 

l'0l"'01*01^'*0l'010^101^0l"01^0l'0l'^01^0l''01^^0l"'01^01^'01^010l''0l"0l"'0l"01^''0l'^0l"'0l'01^0P'' 

2.3.5. w(2;3,35)> 1204 

j34Qj24Qj8gj22gj3Q2jl0Qj29Qj7gj4Qjl4Qjl5Qj7gj2lQj6gj5Qjl3Qj22QjQjl2Qj2gjl2gj7QjllQ2j4gj3Qj7gjl8gj5Qj3gj4Q 

1^0l'^Ol"Ol''01*010l'Ol"Ol'°OPOl'*01^0l''0101^'Ol'*01^01^^0^1^0l'01^'Ol'''01^*01^0l"ol()l''()l*0101-''Ol'^01^ 
01*01*'0l'01^'0l'^0l"0l"01^^010l'*0101^0l"0l"'0l'0l"010l"'0l'0l''0l"01''0l"01^''0l'^01^'01^^0l'*0^1^0P' 

2.3.6. w(2;3,36)> 1257 

jlOQj33gjl6gjl2gj6QjllQj25gjgj4gj5Qj2gjl2Qj6gj29gjgj4Qjl7Qj26Qj2lQjQjl2Qj3gjgj2Q2jl3Qj5gjl6gj32gj2Qjl9o2ig 

1*01^01^^01 '"Ol^^Ol^Ol^Ol^Ol'Ol'^Ol ''01 '""Ol^Ol^Ol '''01 '-^OI'OI '-01-^01 '4)ioi-^*oi'oi'°oi™oi'oi^'^oi-''^ 
l<'01^0P^01^0l'''01^01^''Ol''OP01^''Ol''*Ol'''0^ 101*01^01 ''Ol'ol''OP'01^''0101^0l"'01^''01^01''0^l'^01''01^01*010 

j22gjl6Qj20Qj28QjQj34 

2.3.7. w(2;3,37)> 1338 

j5gj33Qj22Qj2gjl2Q2jl8Qjl4gj9gjl2Qjl5Qj24Qjl9Qjl7Qj7Q2j7Qjgjl2Qj2ojlOQj22Qj8Qj32Qj3Q[6Qjl9Qjl4Qj2lQ2j2Qjl2 

0l'0I^''0I^20l20101^0I^20l'0l"'0l"0l"0l"'0^I^0101^0l'01^-0l'^01-^0101-''0^1''0l''0l''0l"0l'0l'*01^'01''0l' 
0l"'0P'0l'0l"'0l"01^01''0101^^0l"'0l"0l'^010l"0p0l'^0l'0l'*0p0l"'0l"0P°0^1^0101*01^01''0l"010^l'0 

l29gi4ojl4ojl6gil8gil4gj21g2jllgj6 

2.3.8. H'(2;3,38)> 1378 

l^''Ol'''Ol'Ol"01^^010l'^01^'Ol'^Ol"010^I^«01''01^01^''Ol"'Ol'01^''01''Ol"'Ol''01^0l'Wl^'Ol"'Ol'3oroi" 

Q2j4Qj26Qj3gjQjlOQj20Qjl6Qjl8Qj9gjl2gjm2lQj6Qj37Qj(,Qjl5Qjl3Qj6Qjl2Qj8Qj32Qj4Qj5Qjl9Qj3gjlOgj2lQ2j4Qjl8 

Qj28Qj37Qj6gjllQj3gjl0gj7Qjl3Q2j4Qj34Qj5gj4Qj25gj5Q2j3Qj27Qjl0Qj5gj23gj4Qj22Qj8gjl2Qjl6Qjl3Q2j4Qj20Q 

l"'0l''0l'010l"0l'0l"0l''0101*'0^l"0l'^0l'' 
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2.3.9. w(2;3,39) > 1418 



1^01''0l"0l'''0101™01^'0l"0l'0l20l'''01^°01^0^l''01^''0l'-01^01-'0l'*0l"0l'0l''01^°01^0l"^01''0l2'^01* 
0^l2''01^01''01^0l''0l'*01^''0l"01^0l'0l"^0l"01^01^01^-01^0l'^01''0l''*01^''01*0l"0l''0l"010l'''01^'0l"^ 
0l'01-0I^0l''^0l''0l'0l'01^0l''0I^()l-*0l"0l"0101-0l'0l'0l''010^1^'01^-0^l"01-''01-*0l'^01-'0^l'^ 
FOl-'-Ol-O- 1^01-01 "0I-'0l'0I''01-''0l'*0I--01''0l'^01 '^01^' 

2.4. A conjecture on the upper bound 

An important theoretical question is the growth-rate of f w{2\ 3, t). Although the precise relation "w(2; r, t) ^ 
f^" has been invalidated by our results, quadratic growth still seems appropriate (see [35] for a more general conjecture 
on polynomial growth for van der Waerden numbers in certain directions of the parameter space); 

Conjecture 2.1. There exists a constant c > 1 such that w(2; 3, t) ^ ct^. 

By observing the known exact values, computed conjectures, and lower bounds, we may arrive at the following 
recursion: 

w(2;3,f) w(2;3,f- l) + t/(f- 1), 
for f > 3 with w(2; 3, 3) = 9. From our data, we observe that d ^ ^'^^23^^ ~ Solving the recursion, we get 

w(2;3,0 < 1.675(f2 - f) - 1-05 < 1.675f^ 

which satisfies all data regarding w(2; 3, f), presented so fai\ 

3. Patterns in the good partitions 

In this section, we investigate the set of all good partitions corresponding to certain van der Waerden numbers 
w(2; 3, f) for patterns. Let 5(f) denote the set of all binary strings each of which represent a good partition of the set 
{1,2,..., w(2; 3, f) - 1 )■ Generating S (f) involves traversing the respective search space completely. The lower bounds 
reported in the previous section were obtained using local search based satisfiability algorithms, where we stopped 
after the first satisfying assignment corresponding to a good partition is found. On the other hand, to generate all good 
partitions, we need to use a complete algorithm to visit the total search space. So we restrict our search for patterns in 
3 ^ f ^ 14 (with exceptions in Table|5] where t - 15, . . . , 20 and 24 are also considered). 

Let noiB), «i (B), and «oo(B) be the number of zeros, ones, and double-zeros, respectively, in a bitstring B (note that 
three consecutive zeros are not possible in a B e S{t)). Let HP IS (B) denote the sequence of powers of 1 inabitstring B. 
Let np(B) and n^iB) denote the number of peaks (local maxima) and valleys (local minima), respectively, in EPIS(B) 
(not necessarily strict). For example, for the compact bitstring l'*001*'01^0l'01^001^01^01^001^0l'01^01^001** (with 
no = 16, «! = 60 and noo = 4), we have the following EPIS, with p and v, marking peaks and valleys, respectively, 
corresponding to changes in magnitudes. 

8631358531368 
p V p V p 

And for B = l^Ol^Ol^Ol^Ol^Ol^ we have noiB) = 5, ni(B) = 12, noo(B) = 0, while there is one valley followed 
by one peak, and thus ny(B) - n^{B) - 1. 

3.1. Number ofO's and OO's 

In this section, we count the number min{«o(B) : B e 5(01, max{no(B) ■ B E 5(01, and max{noo(B) : B e 5(f)} for 
3 < f ^ 14. Observations in Table[T]lead us to Coni ec tures 13.11 and 13.21 
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Table 1 : Zeros in good-partitions of { 1 , 2, . . . , w(2; 3, ~ 1 1 



w(2' 3 t) 


max|no(B) : B e S(r)| 




w(2;3,3) 


(4, 4) 


2 


vf(2;3,4) 


(6, 6) 


2 


vf(2; 3,5) 


(7, 9) 


2 


vf(2; 3,6) 


(8, 10) 


4 


M'(2;3,7) 


(11, 12) 


3 


w(2; 3, 8) 


(14, 14) 


1 


w(2;3,9) 


(16, 16) 


4 


w(2; 3, 10) 


(19, 21) 


5 


w(2;3, 11) 


(19, 22) 


5 


w(2; 3, 12) 


(22, 22) 


1 


w(2;3, 13) 


(25, 29) 


5 


w(2; 3, 14) 


(29, 29) 


4 



Conjecture 3.1. |no(B) - n()(B')l t, VB, B' e S{t) with f ^ 3. 
Conjecture 3.2. «o()(B) <t,'iBe S(t) with f ^ 3. 

5.2. Number of 1 's 

In this section, we count T - min{np(EPlS(B)) + nv(EPlS(B)) : B e 5(f)}, as well as minimum and maximum 
values of n\{B) over all B E S{t). The observations in Table|2]lead us to Conjecture 13. 31 and Ouestions l3.1l and l3.2l 



Table 2; Selected good-partitions of ( 1 , 2, . . . , w(2; 3, r) - 1 } 



w{l; 3, t) 


A good partition B corresponding to T 


T 


min|ni(B) : B e S(l)], 
max(ni(B) : B e S(t)] 


H'(2; 3, 3) 


1-001-00 
(2 2) 


1 


(4, 4) 


H.'(2;3,4) 


Pooi'oi-''ooi'oi-'' 

(31313) 


5 


(11, 11) 


H.'(2;3,5) 


001-^00l'01^001''0l' 
(31441) 


4 


(12, 14) 


H'(2;3,6) 


Ol'OOl'OI-'OOl'^OOl' 
(5 5 3 5 5) 


3 


(21, 23) 


H'(2;3,7) 


l'0l'01''0l201^01''0l'001^01^01^0l'0 
(1 142541352 5) 


8 


(33, 34) 


w{2; 3, 8) 


i-'oi-ofoi'oi-'oi-'oi'^ooi'^oi^oi-'oi'oi^oi-oi' 

(4 241435534142 1) 


12 


(43, 43) 


H'(2;3,9) 


l*00l''01-'0l'0l'00l50I*0l'001-'01 '01-^01'* 001* 
(8 63135853136 8) 


5 


(60, 60) 


w(2; 3, 10) 


l'01''0l201^001^00l'01''01*0l'01*01''00l'001-001>'0l'' 
(7 4252748184628 9) 


13 


(75, 77) 


w(2;3, 11) 


0l"'01*00l'0l'°01^001''01'*0l'0l''00l'00l"'0l'00l"'0l"' 
(10 4 6 10 2 9 6 1 9 1 10 6 10 10) 


11 


(91, 94) 


R'(2;3, 12) 


l''01*0l''01-01-'0l'0l'01^0101-'0l"0- 
l"01-'010l20l'0l'01-'0l"0l'01''*0l' 
(9 892317213 11 11 312713298 9) 


17 


(112, 112) 


w(2;3, 13) 


1'01''01'-01-'001"001''01'"01^01-'01"01'0 
l''0l''01-01^0l'0l'°0l'00l'01 '^01^01^01- 
(1 6 12 4 11 6 10 2 4 11 1 6 9 2 3 7 10 1 5 12 5 4 2) 


15 


(130, 134) 
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Conjecture 3.3. ni(B') s$ ni(B) ^ w(2;3,t- l) + 2t, VB e 5(f), B' e S{t- 1). 

Question3.1. For each positive constant c does there exist a f' such that for all f ^ f', np(EPlS(B))+«v(EPlS(B)) ^ ct, 
it ^ 3) VB 6 5(f)? (We conjecture yes). 

Question 3.2. Is there a good partition B e 5(f), (f ^ 4) with 3 consecutive numbers equal in EPIS(B)? (Note that, 
for f = 3, the partition 1 '01 '001 '01 ' has four consecutive exponents, which are the same.) 

4. Palindromes 

Recall: 

1. for given k e'H (the number of "colours"), 

2. to, ... , tk-\ (the lengths of arithmetic progressions), 

3. and n e N (the number of vertices) 

we consider block partitions (Pq, . . . , Pu-i) of {!,...,«} (that is, Pq U ■ • ■ U = {1, . . . , m) and P, n Pj - § for 
i + j; the P, may be empty) such that no P, contains an arithmetic progression of length f, — these are the "good 
partitions", and w(k; to, ... , fj^_i) e N is the smallest n such that no good partitions exists^ If (Pq, . . . , Pjt-i) is a good 
partition w.rt. k, to, . . . , t^-i and n, then for 1 ^ n' < n we obtain a good partition w.r.t. k,to,. . ., tk-\ and n' from P by 
just removing vertices n' + 1, . . . , n from their blocks. Thus w{k; fo, ■ . • , fj-i) completely determines for which n € N 
good partitions exist, namely exactly for n < w{k; to, . . . , fn). 

Now if (Po, . . . , Pk-\) is a good partition w.rt. n, then also (Pq, . . . , P^_[) is a good partition w.rt. n, where every 
vertex v e { 1 , . . . , n) is replaced by v' :- n+\ -v. So it is of interest to consider self-symmetric partitions (with P' = P), 
and we call such partitions good palindromic partitions, motivated by the fact that a partition as a string of numbers 
from {0, . . . , fe - 1 ) is palindromic iff the string is a palindrome (reads the same forwards and backwards). For example, 
01^001^0 represents a good palindromic partition for = 2, fo = fi = 3 and n - 8 (namely ({1,4, 5, 8), {2, 3, 6, 7))), 
and so does ({1,3, 6, 8), {2, 4, 5, 7)), represented by OlOl^OlO, while ({1,2, 5, 6}, {3,4, 7, 8)), represented by OOl^OOl^, 
is a good partition which is not palindromic. 

For given k and fo, • . ■ , ft-i again we want to completely determine (in theory) for which n do good palindromic 
partitions exist and for which not. The key is the following observation (with obvious proof; it will follow also from 
Lemmas 14211431 1. 

Lemma 4.1. Consider fixed k,to, . . ., tk-\, and n ^ 3. From a good palindromic partition (Po, . . . , Pk-i) w.rt. n we 
obtain a good palindromic partition (Pq, . . . , P^_i) w.r.t. n - 2 by removing vertices 1, « and replacing the remaining 
vertices v by v - 1, that is, PJ := {v - 1 : v e P,- \ {!,«)}. 

Corollary 4.1.1. If for n there is no good palindromic partition, then there is no good palindromic partition for all 
n + 2 ■ /, / e No. 

Since by van der Waerden's theorem we know there always exists some n such that for all «' ^ n no good palin- 
dromic partition exists, we get that the existence of good palindromic partitions w.rt. fixed fo, . . . , ft-i is determined 
by two numbers, the endpoint p of "always exists" resp. q of "never exists", with alternating behaviour in the interval 
in-between: 

Corollary 4.1.2. Consider the maximal p e No such that for all n ^ /? good palindromic partitions exist, and the 
minimal e N such that for all n ^ q no good palindromic partitions exist. Then - p is an odd natural number, 
where no good palindromic partitions exists for p + I, but p + 2 again has a good palindromic partition, and so on 
alternatingly, until from q on no good palindromic partitions exist anymore. 



= {!,... I, No = NU{0) 
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Definition 4.1. The palindromic van-de r-Waerden number pdvi'(fe; fo, ■ ■ • , fi-i) e Nq is defined as the pair {p, q) such 
that p is the largest /? € No with the property that for all 1 ^ n ^ p there exists a good palindromic partition 
w.r.t. n, while q is the smallest q e N such that for no n ^ ^ there exists a good palindromic partition w.rt. n. We 

use pdw(^; fo, ■ ■ ■ , f/t-Oi - p and pdH'(A:; fo, • ■ ■ , f/t-Oi = So ^ pdH'(^; fo, ■ ■ ■ , < pdw(k',to, ■ ■ ■ ,tk-i)2 ^ 

w(k;to,...,tk-i). 

The palindromic gap is 

pdg(^; fo, . . . , tt-i) := ^(fe; fo, ...,tk-i)- pdw(k; fo, . . . , fn)2 e No, 

while the palindromic span is defined as 

pds(fe; fo, . . . , tk-i) := pdw(^; fo, . . . , fn)2 - pdw(^; fo, . ■ . , fn)i e N. 

To certify that w(A:; fo, . . . , f/t-i) = n holds means to show that there exists a good partition for n - 1 w.r.t. 
(fo, ■ ■ • , h-i) and to show that there is no good partition for n w.r.t. (fo, . . . , f/t-i). For palindromic numbers we need to 
double the effort: 

Corollary 4.1.3. To certify that pdw{k; to, ... , tk-\) - (p, q) holds, exactly the following needs to be shown: 

• there are good palindromic partitions for p - I and q - I w.rt. (fo, . . . , f/t-i); 

• there are no good palindromic partitions for p + I and q + I w.r.t. (to, . . ., f^-i). 

4.1. Palindromic vdW-hypergraphs 

Recall that a finite hypergraph G is a pair G = (V, E), where V is a finite set (of "vertices") and £ is a set of subsets 
of V (the "hyperedges"); one writes V(G) := V and E{G) := E. The essence of the (finite) van der Waerden problem 
is given by the hypergraphs ap(f , n) of arithmetic progressions with progression length f € N and the number n e No of 
vertices (where the arithmetic progressions of length f in N are the subsets of the form {a + i- fi?),£{o,...,r-i) for a,d &N): 

• y(ap(f,n)) := {!,..., n] 

• £'(ap(f, n)) :- {p c {!,...,«) : p arithmetic progression of length f}. 

For example ap(3,5) = ({1, 2, 3,4, 5}, {{1, 2, 3}, {2, 3,4), {1, 3, 5}, {3,4, 5}}). The diagonal vdW-number w(;t; f , . . . , f) 
for k,t e N now is the smallest « e N such that the hypergraph ap(f, n) is not fc-colourable, where in general a 
A;-colouring of a hypergraph G is a map / : V{G) — > {1, . . . , ^} such that no hyperedge is "monochromatic", that 
is, every hyperedge gets at least two different values by /. We see that the SAT-encoding of "w(2; 3, f) > n ?" as 
discussed in Subsection [TTT] exactly consists of the two hypergraphs ap(3, n) and ap(f, n) represented by positive resp. 
negative clauses. 

The task now is to define the palindromic version pdap(f, «) of the hypergraph of arithmetic progressions, so 
that for diagonal palindromic vdW-numbers we have that pdw(^; f, . . . , f) is the smallest n e N such that the hy- 
pergraph pdap(f, n) is not fc-colourable, and such that for two-coloured problems (i.e., k - 2) the SAT-encoding of 
"pdw(2; fo, ti) > n ?" (satisfiable iff the answer is yes) consists exactly of the two hypergraphs pdap(fo, n), pdap(fi , n) 
represented by positive resp. negative clauses (while for more than two colours generalised clause-sets can be used; 
see S). 

Consider fixed f € N and n e No. Obviously pdap(f, 0) :- ap(f, 0) - ({},{}), and so assume n ^ 1. Let the 
permutation m (like "mirror symmetry") of {1, . . . , n} be the map v v' := n -H 1 - v as discussed in the introduction. 
As every permutation, m induces an equivalence relation ~ on { 1 ,...,«} by considering the cycles, which here, since m 
is an involution (self-inverse), just has the equivalence classes {1, . . . , n}/~= {{v,/(v))j,,eji . of size 1 or 2 comprising 
the elements and their images. Note that m has a fixed point (an equivalence class of size 1) iff n is odd, in which 
case the unique fixed point is ^±1. The idea now is to define m' : {!,...,«) — > {!,...,«}, which chooses from each 
equivalence class one representative (so m'{v) e [v,m{v)} and v ~ w o m'(v) = m'(w)), and to let pdap(f, n) be the 
image of ap(f, n) under m' , that is, (m'(V(ap(f, n))), {m'{H)]HeE{iip{i,n)))- Naturally we choose m'iy) to be the smaller of 
V and m(v'). Now it occurs that images of arithmetic progressions under m' can subsume each other, and so we define 
pdap(f, n) as the image of ap(f, n) under m', where also all subsumed hyperedges are removed (so we only keep the 
minimal hyperedges under the subset-relation). 
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Definition 4.2. For f € N and n e No the hypergraph pdap(f, «) is defined as follows: 



. y(pdap(f,n)):={l,...,rfl) 

• £'(pdap(r, n)) is the set of minimal elements w.r.t. c of the set of m'^(H) for H e E(ap{t,n)), where m'^ : 
{I, . . .,n] —> y(pdap(f, n)) is defined by m^,(v) := v for v ^ [|] and m'„(v) := « + 1 - v for v > [|]. 

Usingap(3,5) = ({1,2,3,4,5),{{1,2,3),{2,3,4},{1,3,5), {3,4,5}}) as above, we have m'({l, 2, 3}) = {1,2,3}, 
m'({2,3,4}) = {2,3},m'({l,3,5}) = {1, 3} and m'({3,4, 5}) = {1, 2, 3}, whence pdap(3, 5) = ({1, 2, 3}, {{1, 3}, {2, 3}}). 

Lemma 4.2. Consider f € N and n e Nq. The hypergraph pdap(f, «) is embedded into the hypergraph pdap(f, n + 2) 
via the map e : y(pdap(f , n)) — > y(pdap(f, n + 2)) given by v v + 1 . 

Proof. First we note that |y(pdap(f, n + 2))| - |y(pdap(f, n))| + 1, and so the range of e is y(pdap(f, n + 2))\{l}. LetGbe 
the hypergraph with vertex set y(pdap(f, n + 2)) \ { 1 }, whose hyperedges are all those hyperedges H e £'(pdap(f, n + 2)) 
with 1 i H. We show that e is an (hypergraph-)isomorphism from pdap(f, «) to G, which proves the assertion. 

Now obviously the underlying hypergraph ap(f, n) is embedded into the underlying ap(f, n + 2) via the underlying 
map V e y(ap(f, n)) v + 1 e y(ap(f, n + 2)), where the image of this embedding is given by the hypergraph with 
vertex set V(ap(t, n + 2)) \ {1, n + 2}, and where the hyperedges are those H e E{ap(t, n + 2)) with 1, n + 2 i H. Since 
m^^2(" + 2) = 1 and m^(v) = ^w^+i^^ + 1) - 1 for v E {1, . . . , n}, the assertion follows from the fact that there are 
no hyperedges H,H' e £'(ap(f,n + 2)) with H n {l,n + 2} 0, //' n {1, « + 2} = and m'^^^(H) c m[^^^{H') (thus 
m'^^j(H') can only be removed from pdap(f, n + 2) by subsumptions already at work in pdap(f, n)), and this is trivial 
since 1 e m'^^^iH) but 1 i m[^^^{H'). □ 

The SAT-translation of "pdH'(2; fo, t\) > n ?" is accomplished similar to the translation of "h'(2; fo, t\)> n ?", now 
using pdap(fo, n), pdap(f i , n) instead of ap(ro, n), ap(f i , n): 

Lemma 4.3. Consider fo, t\ e N, fo ^ fi, and n e No- Let the boolean clause-set FP'^(fo, t\,n) be defined as follows: 

• the variable-set is {1, ... , [f 1} {- y(pdap(fo, n)) - y(pdap(fi, «))); 

• the hyperedges of pdap(fo, n) are directly used as positive clauses; 

• the hyperedges H of pdap(fi, n) yield negative clauses {v>},,g//. 

Then there exists a good palindromic partition if and only if FP''(fo, f i , n) is satisfiable, where the satisfying assignments 
are in one-to-one correspondence to the good palindromic partitions of {1, . . . , n} w.rt. (fo, fi). □ 

For more than two colours. Lemma 1431 can be generalised by using generalised clause-sets, as in fi^, and there 
one also finds the "generic translation", a general scheme to translate generahsed clause-sets (with non-boolean vari- 



ables) into boolean clause-sets (see also Il38ll39[l ). 



4.2. Precise values 

See Subsection l5. ll for details of the computation. 



Table 3: Palindromic vdW-numbers pdw(2; 3, f) 



f 


pdH'(2; 3, f) 


pds(2; 3, f) 


pdg(2;3,f) 


3 


(6,9) 


3 





4 


(15,16) 


1 


2 


5 


(16,21) 


5 


1 


Continued on Next Page. . . 



13 



Table 3: Palindromic vdW-numbers pdw(2; 3, f) 



t 


pdw(2; 3, t) 


pds(2; 3, t) 


pdg(2;3,f) 


6 


(30,31) 


1 


1 


7 


(41,44) 


3 


2 


8 


(52,57) 


5 


1 


9 


(62,77) 


15 





10 


(93,94) 


1 


3 


11 


(110,113) 


3 


1 


12 


(126,135) 


9 





13 


(142, 155) 


13 


5 


14 


(174,183) 


9 


3 


15 


(200,205) 


5 


13 


16 


(232,237) 


5 


1 


17 


(256,279) 


23 





18 


(299,312) 


13 





19 


(338,347) 


9 


2 


20 


(380,389) 


9 


> 


21 


(400,405) 


5 


> 11 


22 


(444,463) 


19 


> 1 


23 


(506,507) 


1 


^9 


24 


(568,593) 


25 


> 


25 


(586,607) 


21 


^49 



4.3. Conjectured values and bounds 

For 26 ^ f ^ 39 we have reasonable values on pdH'(2; 3, t), which are given in Table|4] and which we believe to be 
exact for f ^ 35. These values have been computed by local-search methods (see Subsection 15. 2l i. and thus for sure 
we can only say that they present lower bounds. We obtain conjectured values for the palindromic span (which might 
however be too large or too small) and conjectured values for the palindromic gap (which additionally depend on the 
conjectured values from Subsection l2.2l while for f ^ 31 we only have the lower bounds from Subsection l2.3) . 

Table 4: Conjectured paUndromic vdW-numbers pdw(2; 3, t) 



t 


pdw(2;3,f) ^ 


pds(2; 3, t) ~ 


pdg(2;3,f)~ 


26 


(634,643) 


9 


84 


27 


(664,699) 


35 


71 


Continued on Next Page. . . 



14 



Table 4: 



Conjectured palindromic vdW-numbers pdw(2; 3, f) 



t 


pdw(2;3,f) ^ 


pds(2; 3, t) ~ 


pdg(2;3,f)~ 


28 


(728,743) 


15 


84 


29 


(810,821) 


11 


47 


30 


(844,855) 


11 


48 


31 


(916,931) 


15 





32 


(958,963) 


5 


44 


33 


(996, 1005) 


9 


59 


34 


(1054,1081) 


27 


63 


35 


(1114,1155) 


41 


50 


36 


(1186,1213) 


27 


45 


37 


(1272, 1295) 


23 


44 


38 


(1336,1369) 


33 


10 


39 


(1406,1411) 


5 


8 



4.4. Certificates 

Table |5] gives good palindromic partitions for ni - l,n2 - 1 with (ni, n2) = pdw(2; 3,t), 3 ^ t ^ 39, according 
to the values in Tables [3] E] Due to the palindromic property, for the corresponding n we only show the partition of 
{1, . . . , rf 1), so that for example the good palindromic partition 01^001^0 for f = 3 and n = 8 is compressed to 01^0. 
Note that such partitions correspond exactly to the solution of F'"^i3, t, n) as defined in Lemma l43l 



Table 5: Good palindromic partitions for pdvt'(2; 3, t) 



1 


pdvv(2; 3, r) - 1 


Good palindromic partitions 


3 


(5,8) 


1=0, lOM 


4 


(14, 15) 


OlOl-'O, 1=010- 1= 


5 


(15,20) 


I^OIOI^O 


6 


(29, 30) 


1=01^0=1^01, 1^0=150=1 


7 


(40, 43) 


01''0101-'0=l501, 0=1*01=01=01*01 


8 


(51,56) 


l'01=0l'0=101''0l\ 1=01=01*0101''0P01=0 


9 


(61,76) 


1*01^0= 1=01=01=01*01, 1*0= 1*^0^0101^0= 1=01* 


10 


(92, 93) 


l''01*010=l'0=10=l''01*, l''01*0=l=0=l'^0= 1*01*01 


11 


(109, 112) 


1^0POl''010= 1*0= 1^0101*01*01*, l=010l'0101*0l'0=l=0l'0l'^010l"'01 


12 


(125, 134) 


1 "0101*010= 101 '"01*01=01*0101=, i'oi*oi''oi=oi-^oioroi=oioPoi"o 


13 


(141, 154) 


101=01 "01 '»010= 1*01 '»01 "010= lf'01=, 1 "'01*01 "01*01 '»010=l-'0= 101 '"01*01=01* 


Continued on Next Page. . . 
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Table 5: Good palindromic partitions for pdwCl; 3, 



( 


pdw(2; 3, /) - 1 


Good palindromic partitions 


14 


(173, 182) 


1^0l'0l''01-'0l'0l'0l'"010l'"0l'010l''01^0^l', i-'o^i^oi*oioi'oi*oi^oi*oi'oioi*oi-o-i'-'oi^ 


15 


199 
204 


l50l"'0l'0^1''01^01^0l''01*0l"'0l'0l'^0l501^01''01 
I"01^01''0l201"0l30l50l202l201"0101"'01'30l2 


16 


231 
236 


I"'01''01''0|50l202l''0l'201-^oroi202li2oi8oi5oi'»01 

i'-'oi'oi-o-i-'oi'oio'i'*oi'-'oi''o-i"oi'oi^oi'oi-^ 


17 


255 
278 


1"01"'0101^01''0101^01''0P01''01^01"0101'01''0P021* 
I''01<'0l501'01"'01^0l501*0^1"'0101^01''0101*01i^0101i^01'0^1 


18 


298 
311 


l20l201i^01^0101i^02l''01"0P0l50101i30P01^01^0l501i202li001' 
1*'01*01'01"0^1'0^1'^01"'01'01'0^10^1''01'01'01''01"'01"'01*' 


19 


337 
346 


I"01^010l'=02p0l20l"'0l"0^ 101*010^1 "01 "'01^0P02101"01^01"'01^ 
I"0l'*0l'0l0l''0l'0l0l'0l20l"*0l''01()l2o^l'<01-'01^010l'2oi-010l'-0l'^0l5 


20 


379 

3S8 


1 ""O- 101 '"01 '*0|-'01'M)- 1-0|-"01 '-"01 '"0101 "'01 '0101 "'Ol-OI'O- 1 '''0101-01' 

i'''(ii''(ii'oi"Oioi'oi''oi"oi'()ioi"o-ro-i()r'oi'"or'()Poioi-oi-'oi'"oi"'oi 


21 


399 

404 


1^0101*0^1"0P01''01^01"'01"01''01^01'01"'01^0^1''01*01^01'^01'^0^1"0101'01*01'' 
1"*0101'0^1'01"01'0^1'''0101>'0101"01'0101"'01>'0101™0101'01"'0101"01 


22 


443 
462 


l301«0l'0l''0l"01*0l''010l'^0l'010l''0l"'0^l"*01^01^0101^''010l"'0l"0P01^0101'' 
liSoii7oi6oi4oi7oio2l5oii5oiiOoi"02l01<^O^l201«01"02l'01''01i''Ol2'01i''01"01''02l9 


23 


505 
506 


lll0l220l2O02l90l40l802l60l2202l302l60ll00lll0l20l"0ll''01<'02l302l220l60l90l40ll00llO 
I2202il80l20ll90l30l20l802ll80l90ll20l202l90ll90l2010l60ll00l40ll00ll90102l60ll40ll0 


24 


567 
592 


0ii8oi8o2ii5oi8oi2io2i8oii6oio2i6oi2oii4oii0oi20oii''0l"0l201*'0^10li^01*0^l'i0l'0^ 

I'^O^l^Ol^ 

121011801160140170160101140130218017013012012002130170115017013021200120130170190118 

OlOl^'Ol^'Ol'OI^ 


25 


585 
606 


ll9oi9oil8o2j21ojl5o2jojlOojl4ojl3oj5oj9oio2j8oj6oj8o2j(n9oi5Qjl3(nl4ojlOojo2jl5oj21 

O^l^d'Ol^ 

1240140110011701801401230101120150112019021301801«012101^01»01'021'01"0150112010123 

OfOl^O^l^Ol'^Ol 


26 


^ 633 
^ 642 


P0l'^01''0l'20l2''0l2i0l''0l'«oroi''0l'02l'20l"'02l''0l2oi'"0l25oi2oi''02l»0l'2o2 
l"'01''01^0li301''0l'0l'*01 

115oi23qi6qj13oi4qi14oi2oi13oiiioi15o2i6oi12oi5oi7oi14o2iiio2ioi13o1''01"01''01"01<' 
0l''01*01''01^0l"0l"01^ 


27 


^ 663 

> 698 


118oi18oi22oi5oi13o2i21oi4oi9o2i2oj8oi2oj3oj8q2j12qj18oi9oi14qi5oi9oi18oi12o2 
112qi14o2i9oi2oi25oi4oi4 

l22oi6oi2SoiiOoi2Soiiio2i9oioiSoi3oi''02l'01i''01^'Ol30^1"01"0^l30l5oii2oiii01'0 
I'D^l'M^iOli^Ol'^Ol^Ol 


28 


^ 727 
^ 742 


l26oi5oil3oi8oil9oil3oilOoil9oil2oilSoi7o2ll2Qiml2o2i4Qi9oi2oil5oi22oil5oi2oi9o 

l50l''010l"0l"01*0l''01''01«'0l2 

1210122010140125011001012301110219017013017021901140127021190117021301501120111017 

0l''02l'0l2i0l'20l'^01*01 


29 


^ 809 
^ 820 


l220l2''0l2»0l"0P01''01''0l"'0l220l"'0l'i0li202l20l'0l50l'"0l"0P0l50l220 1 0P0l2''0 
Iii0l301*0l'0lii0li20lii0l20l2*02l30li2 

1130102116011501140101401201270140211401120116016017011001701601160112012001901401210 


Continued on Next Page. . . 
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Table 5: Good palindromic partitions for pdwCl; 3, 





pdw(2;3,0- 1 


Good palindromic partitions 






1"01'01''01"01''0M"01™01''0101'01»01'' 


30 


^ 843 
^ 854 


1290 1 25 q2i17oi9oi5o2i22oj7oi10,)i9oi4,)i9,)2i17,)iI3,)i8oi28oi5,)i10oiqi10oi5o1801WO 
ll2Qi24oil3oi4oilOoi2oil5oi9oio2il5oill 

I2'0101^0101'01''01i*^01i801»0l2i02l''0l30l201"01"01''01i*0l301"01^01''01i°01'0l2501i<' 
0^1''01"*0^1''01"01^0101''01^01^'01''01'01" 


31 


^ 915 
^ 930 


li^01^01''0l'0l"01*0l"^0l"'01''01^01''0l"01^'01<'01^<'0l'0101^01*0l"'010^1^'0l'0l'*0 
1201'<'0120101'01^01''01301^01201012S012<'01"0101'<'015 

11201190116011001901301401140112011401180211001401 130180150101 H'Ol"01>50l"S01«Ol501« 

0101''0l''0l'''0l"'0l50l'*01^01^50l"'0l"'01^0l''0l'0l"0l"01 


32 


> 957 
^ 962 


1^01'01''01"01*'01''01"01™0^1«01'01^''0^1"01"01''01"01"01'''01"01'01"0^1'0 

l-'0l'-0l"0l'0'l'01--'01''0l'0r01^-0l'0l'^01^°01^01<'01 

l'U)-l'*()l-U)l '()|-()li'()l'''()l()l''H)l' '()i n)l--t()l 5()l-()l-"()l '-()!-'()- l 

01 '"oi''()i '''or'oi-'oi~oi-oi-"()- i-'Hii-'oi'^oi '''oi- 


33 


^ 995 
^ 1004 


I"010l2<'0l'0^l'2oi''01»0l'0l''01*0l"0|-'0l'-'0l"'0l'2oi"01^010l"'02l''0l'01^i0l"01^* 
010l"0P0^l"0P"0l'0101^*0P^01^01*0l50l'0l''0P0l" 

I201^0l2201''0l320l20l902l01''0l50l2301i«01™0102li^01"01'01*^01'01i'01"01''01"01i"0 

l26o2llSoilOoi23oil2oil5oi9oil2oi6o2lSo2l3oi31oil2 


34 


^ 1053 
^ 1080 


1'01"01^'0101''01"01*01^01'^01'01''01'^01"0101^01''01'01''01'01'^0101"01'01'^01*' 
0l''0^l'01''0l''0l'010l'*'01''0l''0l'01^0l'0^1^^0101*0l"'0l' 

I23o2l9oi22oil3oi25o2l21oi8oil5oill0ll802l''0l20010l20l50l2101*01*0|2-^01''01«01" 

0l'°01^°01*0-l-0l"0l"'01''01^01-'''0101^=01*0l''0l"01*0-l'20l2'0-l'0l' 


35 


^ 1113 
^ 1154 


l'0l"01^'01^*01^0l"0101^^01''010l"'0l''0l"0l2*0l'0l'201^'01^'02l0l'0P0l"01^0l2^0 
I'0l"0l''0l'010^10l50l"0l"'01^'010l'0l'0l'01^'0l"01™01^=0l"'0l'' 

12801S0118016019012901601901601120150101100125012801130180190130125012001301210150 

l"'01^01^''010l'01^0l''0l"01^0l"0l"01*0^1'50101^0^l"01^''01^'0l' 


36 


^ 1185 
^ 1212 


1*'01^*01"'01'*'01^*01''01''01''01^'0101^01*01^01'01=01'=01^0101"01™01''01'^01'01"0 

l''0l"0l'01''0l'*010l'0l'01^01''01'^01^'0l'0|-*0|-''0l"()l-'0-101-'-()l'0l"0l'"0l'5 

l30l2501"OP''Ol501''01"01"01i»OlW«0102p2oiiooii3oi20oioi4oii2oioiiJoi2"0101*01''Ol32 
010l2801''01"0l3^01^01'«0l201^02l301^01«01™01i301"01'0l2301''0lS 


37 


^ 1271 
^ 1294 


l^^OlOOl'^Ol'^Ol^'OlO-l'^O^l^^Ol'Ol-'Ol^Ol-^Ol '-^01^-0- I'^OPOlO^l^Ol^'Ol'f'O^ 101'" 
01^^0l"0l''0l'0l''0l'^0l'01^''0l"0l5010l"010^P'01™0^1''0l'^0l"0l'0l'01^01''0l" 

I70l240l3l0l60l30l30010l30l40l220l34010l80l320l80l220l20l70l20l30l230l60l220ll0010l4 

01'01'''01''01301'01'=012''01"0130101201"'01220128011201801">01'*0101*0122012301''01'201'' 


38 


^ 1335 
^ 1368 


l"'0p50l'^01*0l'^0l'010P^0l"010l50l"01^0l'01^^0P''0^l"0l'01^''0101''0P'0l"0101'' 
01*0^101^''0l''010l''0l'*0l'0^1*'01^'0l'Wl"01^*0101^'0^l"0l'0l'^0l'0l'' 

I"0l'01»'0l'2o2,3o2i8oi20oioiiioi33oi4oii(>oi8oi2«0l'01<0l2''0l"'0l22oi't0l801<0l"0is 
01''0l"0l"0l2*'0l"0l"01''0l220l'01"01*0l''02l''0l"0l2''0l"*0l220l30l'02l-=0l"i0l2'01 


39 


> 1405 
^ 1410 


101"S0l501'S0l201'S0l3301''01i°02l2*01'S01''0P'01i*01'S01"'010l3f^0n220l330101"0l20101i'0p2 

01-''01*0l"0l2i0' 101 '-^01^01-^-01 "01 "01*01 "01^''0-l'=01-^f'0l"01'''0l'''01-0l'010^1 "01 "01* 

I30l 340 l240l30l 320 l40ll802l 350ll20l70ll60lll0l60l''0l2'>01«0l''0l"0l50l"'010l230il602l31 

0l301i^01''01'01<'0l2S01"01"0102l'01i«01*'02l*0l201"0l30l320l501i^0l3''010l2''01i501'30l301' 



4.5. Open problems 

• It seems the palindromic span can become arbitrarily large — also in relative terms? Perhaps the span shows a 
periodic behaviour, oscillating between small and large? 
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• Similar questions are to be asked for the gap. Does it attain value infinitely often? 

• Properties of the hypergraphs pdap( f,n) seem to be of interest and non-trivial, starting with the estimation of 
the number of hyperedges and their sizes. 

• Which structure of the palindromic problems makes them more amenable to conflict-driven SAT solvers than 
ordinary vdW-problems (see Subsection lS.H ? 



4.6. Remarks on the use of symmetries 

The heuristical use of symmetries for finding good partitions has been studied in 1 18, 2^ T^. Especially we find 
there an emphasise on "internal symmetries", which are not found in the problem, but are imposed on the solutions. 

The good palindromic partitions introduced in this section are more restricted in the sense, that they are based 
on the symmetries m from Subsection 14. II of the clause-sets F expressing "w(fc; fo, fi, . . . , f<r-i) > n ?" (i.e., we 
have m{F) - F; recall Subsection 1 1.1b . which then is imposed as an internal symmetry on the potential solution by 
demanding that the solutions be self-symmetric. In |18] "reflection symmetric" certificates are mentioned, which for 
even n are the same as go od p alindromic partitions, however for odd n they ignore vertex 1, not the mid-point [fl as 
we do. This definition in lll8ll serves to maintain monotonicity (i.e., a solution for n + 1 yields a solution for n, while 
we obtain one only for n-1 (Lemma [4. II ). We believe that palindromicity is a more natural notion, but further studies 
are needed here to compare these two notions. 

Further internal symmetries used in Lisl Eol [l9l are obtained by modular additions and multiplications (these are 
central to the approaches there), based on the method from [43] for constructing lower bounds for diagonal vdW- 
numbers. No generalisations are known for the mixed problems we are considering. 

Finally we wish to emphasise that we do not consider palindromicity as a mere heuristics for finding lower bounds, 
but we get an interesting variation of the vdW-problem in its own right, which hopefully will help to develop a better 
understanding of the vdW-problem itself in the future. 



5. Remarks on the use of SAT solvers 



We conclude by summarising the experimental results and insights gained by running many SAT solvers on the 
many instances considered in this paper. 



5.7. Complete solvers 

Complete SAT solvers exist in mainly two forms, "look-ahead solvers" and "conflict-driven solvers"; see li^ 21 1 



for general overviews on these solver paradigms. In general, for (ordinary) vdW-problems look-ahead solvers seem to 
perform better than conflict-driven solvers, while for palindromic problems it seems to be the opposite, however this 
might depend on the parameter setting, and with confidence we can only speak for the two problem classes considered 
in this paper. Considering publically available general-purpose SAT solvers: 

• satz ([41], version 215), DKsolver (the version from 2002) and march_pl for look-ahead solvers. 



• and minisat (MM) versions 1, 2, 2.2.0, picosat version 913, precosat versions 236, 570, glucose version 
1.0, and cryptominisat, version 2.9.0, 

the overall "winner" on vdW-problems regarding w(2;3,f) amongst these solvers is satz (while from the conflict- 
driven solvers minisat-2 .2.0 and glucose perform best)0 We only considered t ^ 16, where on a current laptop 
for f = 13 satz takes about 10m, about 2h for t - 14, about 20h for t = 15, and about 200h for f = 16. If we assume a 
running-time factor of 10 for increasing ? by 1, then for t - 19 it would take about 30 years (on a single medium-fast 
processor), which is better than the 200 years as reported in Section |2] (using a specialised, but more simple SAT 
solver), however this is hkely misleading, due to the quadratic growth of the number of variables. 



'^We need to point out that this only holds for vdW-problems considering w(2; 3, t). As already demonstrated in L36,] . with every parameter 
change a new game begins, and solvers behave very differently. 
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Showing w(2; 3, 20) = 389 (recall Subsection |2j2j seems to require new (algorithmic) insight. Since look-ahead 
solvers are good on this problem class, the theory of optimising heuristics as outlined in ll34ll is applicable, and should 
push the frontier. 

The situation changes for palindromic vdW-problems, where conflict-driven solver perform much better than look- 
ahead solvers. Minisat-2 .2.0 and precosat570 are clearly the strongest amongst the solvers we considered (the 
same as above), where perhaps for the largest problems precosat570 gets the upper handl For f = 25 and the 
hardest problem n - 608 it took about 10 days to determine unsatisfiability. 

We remark that for pdw(3, t) with odd n we can determine that the middle vertex ^ can not be element of the 
first block of the partition (belonging to progression-size 3), since then no other vertex could be in the first block (due 
to the palindromic property and the symmetric position of the middle vertex), and then we would have an arithmetic 
progression of size t in the second block. Due to this (and there might be other reasons) palindromic problems for odd 
n are easier (running times can go up by a factor of 10 for even n)0 

Using the new method developed in OTIl (see also ll'M ). meanwhile we were able to verify the conjectured values 
for pdw(2; 3, (recall Subsection I4.3l l for t - 2b and t - 21 (still on a single computer!). In general it would be 
very interesting to find out why look-ahead solvers are good on ordinary vdW-problems and conflict-driven solvers 
on palindromic vdW-problems. 



5.2. Incomplete solvers (stochastic local search) 

In the OKlibrary we use the Ubcsat suite (see |49]) of local-search algorithms in version 1-2-0. The considered 
algorithms are GSAT, GWSAT, GSAT-TABU, HSAT, HWSAT, WALKSAT, WALKSAT-TABU, WALKSAT-TABU- 
NoNull, Novelty, Novelty+, Novelty-i-+, Novelty-i-p, Adaptive Novelty+, RNovelty, RNovelty+, SAPS, RSAPS, 
SAPS/NR, PAWS, DDFW, G2WSAT, Adaptive G2WSat, VWl, VW2, RoTS, IRoTS, SAMD. The performance of 
local-search algorithms is very much instance-dependent, and so a good choice of algorithms is essential. Our exper- 
iments yield the following selection criteria: 

• For standard problems (Section|2) the best advice seems to use GSAT-TABU for f ^ 23, to use RoTS for f > 23, 
and to use Adaptive G2WSat for t > 33 (also trying DDFW then). 

• For the palindromic problems (Section|4} GSAT-TABU is the best algorithm. 

Finally, using for vdW-problems a solution found for n - 1, and for palindromic vdW-problems a solution found for 
n-2 (according to Lemma l4Tb as initial guess, helps to go much quicker through the easier part of the search space 
(of possible n), and also seems to help for the harder problems. 
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Appendix A. Using the OKlibrary 

The OKlibrary, available at http : //www . ok- sat-library . org" is an open-source research and development 
platform for SAT-solving and related areas (attacking hard problems); see L33.1 for some general information. For the 
purposes of this article the following components are directly relevant: 

• The OKlibrary provides an already rather extensive library of functions for the computer algebra system 
Maxima. For example all hypergraph generators discussed in this article, and all vdW- and palindromic vdW- 
numbers can be computed and investigated at this level. 

• For computations which take more time, C++ implementations are available. 

• The OKlibrary provides easy access to a large number of (original) SAT solvers and related tools (as "external 
sources")|Zl 

• Finally these components are integrated into tools for running and evaluating experiments]! 
In the following sections we demonstrate the use of these tools. Some general technical remarks: 

1. The installed OKlibrary lives inside directory OKplatf orm. 

2. Any where inside this directory the Maxima-installation is called via oklib — maxima on the (Linux) command- 
line. 

3. The C-H-i- programs as well as the external sources, here the various SAT solvers, are placed on the path of the 
(Linux) user, and are thus callable by their name on the command-line (anywhere). 

Appendix A.l. Numbers and certificates 

All known vdW-numbers and palindromic vdW-numbers and known bounds are available at the computer-algebra 
level in the OKlibrary (using Maxima). For example the (known) numbers w(2; 3, t) and pdw(2; 3, f) are printed as 
follows (where inside the OKlibrary we typically use the letter "k" for the length of an arithmetical progression, not 
"t" as in this article): 

OKplatform> oklib — maxima 
(7.il) oklib_load_all() ; 

C/,i2) for k : 3 thru 40 do block( [L: [3,k]] , 

print (k,vanderwaerden(L) , pdvanderwaerden(L) , pd_span(L) , pd_gap(L))); 
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[6,9] 3 







4 


18 


[15,16] 


1 


2 


5 


22 


[16,21] 


5 


1 


6 


32 


[30,31] 


1 


1 


7 


46 


[41,44] 


3 


2 


8 


58 


[52,57] 


5 


1 



'The aim is to serve as a comprehensive collection, also maintaining "historical" versions. 
**In general we use the R system for statistical evaluation. 
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9 77 [62,77] 15 

10 97 [93,94] 1 3 

11 114 [110,113] 3 1 

12 135 [126,135] 9 

13 160 [142,155] 13 5 

14 186 [174,183] 9 3 

15 218 [200,205] 5 13 

16 238 [232,237] 5 1 

17 279 [256,279] 23 

18 312 [299,312] 13 

19 349 [338,347] 



20 [389,inf-l 

21 [416,inf-l 

22 [464,inf-l 

23 [516,inf-l 

24 [593,inf-l 

25 [656,inf-l 

26 [727,inf-l 

27 [770,inf-l 

28 [827,inf-l 

29 [868,inf-l 

30 [903,inf-l 

31 [931,inf-l 

32 [1007, inf- 

33 [1064, inf- 

34 [1144, inf- 

35 [1205, inf- 

36 [1258, inf- 

37 [1339, inf - 
[1379,inf- 
[1419, inf- 



38 
39 



9 2 
[380,389] 
[400,405] 
[444,463] 
[506,507] 



9 [0, inf -390] 
5 [11, inf -406] 
19 [1, inf -464] 
1 [9, inf -508] 
[568,593] 25 [0, inf -594] 
[586,607] 21 [49, inf -608] 
[634,643] 9 [84, inf -644] 
[664,699] 35 [71, inf -700] 
[ [728 , inf -1] , [743 , inf -1] ] 



[[810, inf -1] , [821, inf -1]] 
[[844, inf -1] , [855, inf -1]] 
[[916, inf -1] , [931, inf -1]] 
[[958, inf -1] , [963,inf-l]] 
[[996, inf -1] , [1005, inf -1]] 



15,0] 
11,0] 
11,0] 
15,0] 
[5,0] 
[9,0] 



[84,0] 
[47,0] 
[48,0] 
[0,0] 
[44,0] 
[59,0] 



[[1054,inf-l] 
[[1114,inf-l] 
[[1186,inf-l] 
[[1272, inf -1] 
[[1336, inf -1] 
[[1406, inf -1] 



[1081,inf-l] 
[1155,inf-l] 
[1213,inf-l] 
[1295,inf-l] 
[1369,inf-l] 
[1411,inf-l] 



[27,0] 
[41,0] 
[27,0] 
[23,0] 
[33,0] 
[5,0] 



[63,0] 
[50,0] 
[45,0] 
[44,0] 
[10,0] 
[8,0] 



40 unknown unknown unknown unknown 



As one can see, if only bounds are known instead of a precise number x, then the number x is replaced by a pair (a, b) 
with a ^ X ^ b; "inf - 1" here indicates that the number is finite, but no more precise upper bounds are knownjj 
For palindromic span and gap the "0" indicates that these numbers could go up or down0 
Also the certificates (good partitions) are available, in various representations: 

(°/»i4) cf ull_certif icate_string_pdvdw_3k(34) ; 

("/o4) [["l-{9}01-{14}01--[21}0101-{4}01-{27}01-{8}01-{24>01-{32}01-{3}0 
1-{4>01-{16>01-{13}0101-{2}01-{33>01-{3}01-{4}01-{3}01-{16}0101-{13} 
01~-[3>01~-[16}01-{6}01-{33}0-{2}l-{9}01-{4}01-{31}01-{2}0101-{16}01-{4} 
01-{31>01~-[3}01-{2}01-{7}0-{2}l-{22}0101-{8}01-{30}01-{3}"] , 

["l-{23}0-{2}l-{9}01-{22}01-{13}01-{25}0-{2}l-{21}01-{8}01-{15}0 
l~-[ll}01~-Cl8}0-{2}l-{4}01-{20}0101-{2}01-{5}01-{21}01-{8}01-{4}01-{23} 
01~{4>01-{8}01-{21}01-{10}01-{20}01-{4}0-{2}l-{2}01-{27}01-{10}01-{4}0 
1-{2>01-{26>0101-{25}01-{8}01-{9}01-{17}01-{4}0~{2}1-{12}01'-[21>0'{2> 
l-{3>01-{3>"]] 

(y»i5) extract_data_certif icates_pdvdw_3k(34) ; 



'in principle tliere exist theoretical upper bounds, but for practical purposes these bounds are completely useless. 

'"At the time of printing this information we achieved to compute pdvv(2; 3, 26) and pdvv(2; 3, 27), using new methods; see the forthcoming (37 
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(7.o5) [[3,34] ,1054,1081, 

[[10,25,47,49,54,82,91,116,149,153,158,175,189,191,194,228,232, 

237 , 241 , 258 , 260 , 274 , 278 , 295 , 302 , 336 , 337 , 347 , 352 , 384 , 387 , 389 , 

406 , 41 1 , 443 , 447 , 450 , 458 , 459 , 482 , 484 , 493 , 524] ] , 
[[24,25,35,58,72,98,99,121,130,146,158,177,178,183,204,206,209, 

215,237,246,251,275,280,289,311,322,343,348,349,352,380,391, 

396 , 399 , 426 , 428 , 454 , 463 , 473 , 491 , 496 , 497 , 510 , 532 , 533 , 537] ] ] 

With the first command we get the representation of the good partitions as used in this paper (where now for the 
palindromic situation we have two good partition according to Corollary |4T3), while the second command yields a 
list with five elements: first the parameter tuple, then the two components of the palindromic number, and then two 
lists with the good partitions available, now represented via the block in the partition for the second colour 

Analysing the patterns according to Section |3] and applying these measurements to the certificates stored in the 
OKlibrary for 20 < f ^ 39 is done as follows: 

(y.i6) for k : 20 thru 39 do 

print (k , f irste ( vajiderwaerden3k (k) ) , 

inap(ELnalyse_certif icate ,full_certif icate_vdw_3k(k) ) ) ; 

20 389 [[[48,341] , [44,45] , [4,37] , [5,27] , [20,1]]] 

21 416 [[[50,366] , [43,44] , [7,34] , [13,26] , [8,1]]] 

22 464 [[[54,410] , [51,52] , [3,47] , [5,40] , [27,1]]] 

23 516 [[[59,457] , [53,54] , [6,46] , [12,36] , [17,1]]] 

24 593 [[[63,530] , [57,58] , [6,54] , [13,37] , [20,1]]] 

25 656 [[[74,582] , [69,70] , [5,64] , [11,45] , [16,2]]] 

26 727 [[[78,649] , [72,72] , [6,64] , [13,42] , [21,1]]] 

27 770 [[[79,691] , [72,73] , [7,65] , [15,58] , [11,2]]] 

28 827 [[[79,748] , [74,75] , [5,64] , [11,44] , [19,1]]] 

29 868 [[[81,787] , [76,77] , [5,69] , [11,57] , [27,1]]] 

30 903 [[[83,820] , [76,77] , [7,67] , [13,57] , [15,1]]] 

31 931 [[[82,849] , [80,81] , [2,77] , [5,53] , [58,1]]] 

32 1007 [[[87,920] , [82,83] , [5,78] , [9,62] , [29,1]]] 

33 1064 [[[89,975] , [85,86] , [4,80] , [9,58] , [25,1]]] 

34 1144 [[[96,1048] , [87,88] , [9,80] , [19,63] , [23,2]]] 

35 1205 [[[95,1110] , [91,92] , [4,84] , [9,67] , [41,1]]] 

36 1258 [[[101,1157] , [97,98] , [4,88] , [9,72] , [42,1]]] 

37 1339 [ [ [105 , 1234] , [97 , 98] , [8 , 90] , [17 , 65] , [30 , 2] ] ] 

38 1379 [[[104,1275] , [96,97] , [8,91] , [17,73] , [26,1]]] 

39 1419 [[[105,1314] , [98,99] , [7,95] , [13,72] , [46,1]]] 

Per line we print out three items: t, the lower bound on w(2; 3, f) and the list of data for each stored certificate. Now 
currently we have only stored one certificate for each 20 ^ f ^ 39, and thus the third item contains just one list, with 
five pairs for the dififerent statistics^ These five pairs have the following meaning: 

1. First come «o and «!. 

2. Then come the numbers of terms 0' and 1 ' (we don't use "00" here, and so these terms alternate, and thus their 
numbers diff'er at most by one). 

3. Then from these counts the cases with s - I are excluded; thus the first element of the pair is noo- 

4. Now these exponents s are put in the list, and the sums of the numbers of peaks and valleys are computed; again 
for block and block 1 of the partition, and thus now the second element of the pair is tip + tiy. 

5. Finally for these lists of exponents the maximal size of an interval with constant values is computed; thus if 
there were a second element of the pair with value 3 or greater, then Ouestion l3. 21 would have been answered in 
the positive. 



We found more than one solution in each case, but always very similar to the one stored; there seems to be a clustering of solutions, and 
perhaps there is always only one (or very few) cluster. 
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Appendix A.2. Hypergraphs 

The hypergraphs are available at Maxima-level, and the computational expensive pahndromic hypergraph also at 
C++ level: 

(°/.i8) arithprog_hg(3,5) ; 

C/.08) [{1,2,3,4,5>,{{1,2,3>,{1,3,5},{2,3,4},{3,4,5»] 
C/oiQ) arithprog_pd_hg(3 , 5) ; 
C/.09) [{1,2, 3}, {{1,3}, {2, 3}}] 

> PdVanderWaerden-OS-DNDEBUG 3 5 

c Palindromised hypergraph with arithmetic-progression length 3 

and 5 vertices, 
p cnf 3 2 
13 
2 3 

Appendix A.3. SAT instances 

The SAT-instance for considering w{2; 3, f) with n vertices is created by the program call 

VanderWaerdenCNF-03-DNDEBUG 3 t n, 

for example for ? = 4 and n = 6 

> VanderWaerdenCNF-03-DNDEBUG 3 4 6 

> cat VanDerWaerden_2-3-4_6 . cnf 

c Vein der Waerden numbers with partitioning into 2 parts; 

SAT generator written by Oliver Kullmann, Swansea, May 2004, October 2010. 
c Arithmetical progression sizes kl = 3, k2 = 4. 
c Number of elements n = 6. 

c Iterating through the arithmetic progressions in colexicographical order. 



p cnf 


6 9 




12 3 







2 3 4 







13 5 







3 4 5 







2 4 6 







4 5 6 







-1 -2 


-3 -4 





-2 -3 


-4 -5 





-3 -4 


-5 -6 






The SAT-instance for considering pdw(2; 3, t) with n vertices is created by the program call 

PdVanderWaerdenCNF-03-DNDEBUG 3 t n, 

for example for ? = 4 and n = 9 

> PdVanderWaerdenCNF-03-DNDEBUG 3 4 9 

> cat VanDerWaerden_pd_2-3-4_9 . cnf 

c Palindromic van der Waerden problem: 2 peirts, arithmetic progressions of 

size 3 and 4, and 9 elements, yielding 5 variables, 
p cnf 5 10 
12 3 
2 4 
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13 4 
15 

2 5 

3 5 

4 5 
-2 -4 

-1 -3 -5 
-3 -4 -5 

Appendix A.4. Running experiments 

For running Ubcsat-algorithm to determine lower bounds on vv(2;3,f) and pdvy(2; 3, f), also providing "con- 
jectures" on the precise values, we have the following tools (using no parameters here serves to print some basic 
helper-information) : 

> RunVdWklk2 

ERROR [RuiiVdWklk2] : Six parameters kl, k2, nO, alg, runs, cutoff 

are needed: The progression-lengths kl,k2, the starting number nO of 
vertices, the ubcsat-algorithm, the number of runs, eind the cutoff. 
An optional seventh parameter is a path for the file containing an 
initial assignment for the first ubcsat-run. 

> RunPdVdWklk2 

ERROR [RunPdVdWklk2] : Five parameters kl , k2, alg, runs, cutoff 
are needed: The progression-lengths kl,k2, the ubcsat-algorithm, 
the number of runs, aind the cutoff. 

And for running complete solvers on palindromic instances we have 

> CRunPdVdWklk2 

ERROR [CRunPdVdWklk2] : Three parameters kl, k2, solver, are needed: 
The progression-lengths kl, k2 and the SAT solver. 
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